APPRC'.^O 


O.G. F!G. 


BY 


CLASS 


SUBCLASS 


DRAFTSMAN 







FIG. 1 



M. A. Cesare et al. 
ST9-99-037 
Sheet 1/10 



Server 4 



Database Program 8a 



Database 10 



Input 
Table 
14 



Output 
Table 
18 



y 



Clean Tansform Stored 
Procedure 
12 



Transform Rules 
16 



Client 6 



Database 
Program 8b 



Client Side of Clean 
Transform Stored 
Procedure 
20 



FIG. 3a 



M. A. Cesare et al. 
ST9-99-037 
Sheet 3/10 



find pattern 



replacement text value 



FIG. 3b 



find value 



replacement text value 



FIG. 4 



=1- 



n 

n 5 



lower bound 



upper bound 



replacennent value 



i. " 



FIG. 5 



lower bound 


lower replacement value 


upper bound 


upper replace value 



BY 

J DRAFTSMAN 



I DRAFTS 



CLASS 



SUBCLASS 





Receive clean parameters 

in API function call 
including definition sub- 
parameters. 



Access input data table indicated in 
Input data table name parameter. 




M. A. Cesare et al. 
ST9.99-037 
Sheet 4/10 




FIG. 6a 



For each rule definition in 
replacement rule definition(s) 
parameter, do: 



206 



C 3 S 

'S 



Access input data column indicated 
in input data column name. 


1 


r 



.208 



Access rule table from rule table 
name parameter and rule table 
columns including rules from rule 
table columns parameter. 



Sort Indicated mle table 
column according to sort key. 





Insert replacement value/pattem 
in column in rule table indicated in 
rule table columns parameter into 
any field in input data column 
matching file value/pattern. 



Go to block 204 if there are 
further rule definitions to process. 



242 



M. A. Cesare et al. 
ST9-99-037 
Sheet 5/10 



FIG. 6b 




If field in input data column >= 
(lower bound - tolerance), then do 
replacement in such field. 





M. A. Cesare et al. 
ST9-99-037 
Sheet 6/10 

FIG. 6c 



w 




270^ 



If field in input data coiunnn <= 
(upper bound + tolerance), then do 
replacement in such field. 




258 



If field in input data column 
<= (upper bound + tolerance) 
and >= (lower bound - 
tolerance), then do replacement 
in such field. 



Return to block 248, until all fields 
in input data column processed. 



^274 



Return to block 249 until all 
rules in rule table processed. 





f 




276 






Return to block 248, until all fields 
^in input data column processed. ^ 




Go to block 204 if there are further 
^ rule definitions to process. ^ 




Go to block 326 
in FIG. 6e 


► 


► 






3Q6 



M. A. Cesare et al. 
ST9-99-037 
Sheet 7/10 



3Q0 



Determine upper bound, lower 
bound, and replacennent value 
from first, second, and third 
columns in rule table indicated in 
rule table columns parameter. 





Yes> 



Pass, no 
replacements in any 
field in input data 
column 



For every field in input 
data column, do: 



324: 
FIG, 6e . 



5_J 



^0 



For every rule in rule 
table, do: 



308 



Yes- 





FIG. 6d 



Do not do 
replacement 




316. 



Yes- 



Is field in input data column <= 
(lower bound - tolerance), then 
replace with lower replace value. 




Yes- 



Is field in input data column >= 
(upper bound + tolerance), then 
replace with upper replace value 




322; 
FIG. 6e 



M. A. Cesare et al. 
ST9-99-037 
Sheet 8/10 



FIG. 6e 




If field in ihput data column <= (lower bound - 
tolerance) or >= (upper bound + tolerance), 
then use corresponding replacennent value. 



Return to block 307 until all 
rules in rule table processed. 




123 



312 




312 



Return to block 306, until all fields in 
input data column processed. 




326 



328 



Go to block 204 if there are 

further rule definitions to 
V process. > 




-Yes- 



Write modified 

input data 
column(s) to 
output data 
table. 



Write modified 

input data 
column(s) to 
input data table. 



M. A. Cesare et al. 
ST9-99-037 
Sheet 9/10 



FIG. 7 



CITY 


Product Name 


Product Category 


Sales ($) 


Unit Sales 


Varchar (20) 


Varchar(30) 


Varchar (15) 


Decimal(8,2j 


hiteger 


11%LAX455 


Pick Axe 


lATE 


5897.32 


1100 


SJC1298%SEA 


Small Shovel 


JXJ8972 


456.98 


43 


%lax6%SF094 


Tarp 


HAU1245 


893.12 


422 


%SJC34LAX45 


Flares 


AXlll 


1200.45 


1189 


4515%SEA 


NULL 


AC183 


333.9 


23 


3452%SJC34 


III! 


MIS222 


1818.4 


15002 



FIG. 8a 



Fiad Value 


Replace Value 


Varchar(10) 


Varchar(20) 


%%LAX% 


Los Angeles 


%%DFW% 


Dallas/Fort Worth 


%%SJC% 


San Jose, CA 


%%SFO% 


San Francisco 


%%SEA% 


Seattle/Tacoma 



FIG. 8b 



Lower Bound 


Lx)wer Replacement Upper Bound 


Upper Replacement 


Inteeer 


Inteeer Inteeer 


Inteeer 


100 


100 15000 


NULL 



M. A. Cesare et al. 
ST9-99-037 
Sheet 10/10 



FIG. 8c 



Sort Column 


Lower Bound 


Upper Bound 


Replace Value 


Integer 


Varchar(6) 


Varchar(6) 


VarcharflO) 


1 


I 


IZZZZ 


Special 


2 


HA 


JZ 


Hardware 


3 


LA 


NZ 


Miscel 


4 


AA 


BJ 


Winter Storm 


5 


RX 


NULL 


Winter Storm 



FIG. 8d 


Find Value 


Replace Value 




VarcharfS") 


VarcharClO) 




NULL 


III! 





